To Phrase or Not to Phrase – Impact of User versus System Term Dependence upon Retrieval
نویسندگان
چکیده
منابع مشابه
To Phrase or Not to Phrase - Impact of User versus System Term Dependence Upon Retrieval
When submitting queries to information retrieval (IR) systems, users often have the option of specifying which, if any, of the query terms are heavily dependent on each other and should be treated as a fixed phrase, for instance by placing them between quotes. In addition to such cases where users specify term dependence, automatic ways also exist for IR systems to detect dependent terms in que...
متن کاملPhrase structure and dependence
Syntactic structures have been analysed in terms of constituent hierarchies and also in terms of dependency relations. While these two traditions have sometimes been presented as competing with each other, there is no reason in principle why syntactic dependencies and constituent hierarchies should not both be part of the grammar, complementing each other. Indeed, within transformational gramma...
متن کاملPhrase Clustering for Smoothing TM Probabilities - or, How to Extract Paraphrases from Phrase Tables
This paper describes how to cluster together the phrases of a phrase-based statistical machine translation (SMT) system, using information in the phrase table itself. The clustering is symmetric and recursive: it is applied both to sourcelanguage and target-language phrases, and the clustering in one language helps determine the clustering in the other. The phrase clusters have many possible us...
متن کاملAn Efficient Phrase-to-Phrase Alignment Model for Arbitrarily Long Phrase and Large Corpora
Most statistical machine translation (SMT) systems use phrase-to-phrase translations to capture local context information, leading to better lexical choices and more reliable word reordering. Long phrases capture more contexts than short phrases and result in better translation qualities. On the other hand, the increasing amount of bilingual data poses serious problems for storing all possible ...
متن کاملBilingual phrase-to-phrase alignment for arbitrarily-small datasets
This paper presents a novel system for sub-sentential alignment of bilingual sentence pairs, however few, using readily-available machine-readable bilingual dictionaries. Performance is evaluated against an existing gold-standard parallel corpus where word alignments are annotated, showing results that are a considerable improvement on a comparable system and on GIZA++ performance for the same ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Data and Information Management
سال: 2018
ISSN: 2543-9251
DOI: 10.2478/dim-2018-0001